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ABSTRACT - An upper bound an the average error probability for 
maximum- likelihood decoding of the ensemble of random binary tres 
codes is derived and shown to be independent of the length of the 
tree. 

An upper bound on the average error probability for maximum-likelihood 
decoding of the ensemble of random L-branch binary trellis codes of 
rate R a 1/n • ? s derived which separates the effects of the tail length 
T and the mem. / length M of the code. It is shown that the bound is 
independent of tne length L of the information sequence when 
M T + f n Ey|jCR)] ^ log2L. This implication is investigated by compu- 
ter simulations of sequential decoding utilizing the stack algorithm. 
These simulations confirm the implication and further suggest the follo- 
wing empirical formula for the true undetected decoding error probabili- 
ty with sequential decoding: 

* 

P|>] « c2 n ^ R T , provided M >_ T ♦ [nE^CRl] ^ log^L 

where c is a constant independent of L, T and M, The exponent R^ is 
related to Viterbi's upper and lower exponents for the ensemble of 
time-varying convolutional codes by the inequality: 

Eyy(R) <_ Ry <_ Ey^(R). 

11 The first draft of this paper was presented orally at the 

IEEE International Symposium on Information Theory, Ashkelon, 

Israel, June 25-29, 1973. 
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ABSTRACT - An upper bound on the average error probability for 
maximum- likelihood decoding of the ensemble of random binary tree 
codes is derived and shown to be independent of the length of the 
tree. 

An upper bound on tho average error probability for max imun- likelihood 
decoding of the ensemble of random L-brarch binary trellis codes of 
rate R • 1/n *s derived which separates the effects of the tail length 
T and the men . / length M of the code. It is shown that the bound is 
independent o* tne length L of the information sequence when 
M >. T ♦ [nE V u(R)] 1 log^L. This implication is investigated by compu- 
ter simulations of sequential decoding utilizing the stack algorithm. 
These simulations confirm the implication and further suggest the follo- 
wing empirical formula for the true undetected decoding error probabili- 
ty with sequential decoding: 

P[c] » c2 nTR T , provided M >_ T ♦ [nE^CR)] 1 log^L 

where c is a constant independent of l, T and M. The exponent Ry is 
related to Viterbi's upper and lower exponents for the ensemble of 
time-varying convolutional codes by the inequality: 

E VU «>i«TiEvL tR, ‘ 

1} The first draft of this paper was presented orally at the 

IEEE International Symposium on Information Theory, Ashkelon, 

Israel, Juno 25-29, 1973. 
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I, INTRODUCTION 

i i i r-r. - r * 

Massey [ij has recently defined the class of random tree codes as a 
generalization of the type of convolutional code used in sequential 
decoding and has defined the class of random trellis codes as a 
generalization of the type of convolutional code used in Viterbi 
decoding. He has also proved random upper bounds on the average pro- 
bability of error for maximum-likelihood decoding of these codes for 
codes rates less than Rg, where Rq is less than capacity C of. the 
channel. 

In Section III of this paper, we extend Massey's bound for tree codes 
to all rates lass than capacity. In Section IV we do the same for his 
bound on trellis codes but we do this in the context of a more gene- 
ral class of trellis codes for which a distinction can be made bet- 
ween its "memory length" and its "tail length". The bounds obtained 
suggest that it is advantageous to use a memory length which is a 
specified amount greater than the tail length, this amount depending 
on the length of the trellis. In Section V, we report sequential de- 
coding simulations which confirm this suggestion and which should be 

useful guides in the design of future sequential decoding systems. 

> 

These simulations suggest an empirical formula for the true decoding 

error probability with sequential decoding which we give in Section VI, 

i 

. | i 

t 

To rslate the more general codes used in this paper to the special 
cases used in practice, we note that convolutional codes constitute 
the class of linear tree codes. After L information bits have been 
encoded in the convolutional code, the encoded sequence is terminated 
after a "tail" of T information zeroes has been encoded. A convolutio- 
nal code is further characterized by its encoding memory length M 
which is the number of unit delays in the encoder. The usual practice 
has been to take M°T but the merit of removing this restriction will 
become obvious in the sequel. 


— t 
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II. PRELIMINARIES 

In our discussion of tree and trellis codes, we shall restrict our 
attention to codes of rate R » 1/n both for simplicity of descrip- 
tion and becauss these rates are those of the most practical inte- 
rest. 

A binary tree code of rate R ° 1/n, tree length L, and tail length T 
is formed by assigning n channel input symbols to each branch of a 
rooted tree such that 2 branches stem from the root node, 2 brandies 
stem from each successive node at depth i from the root for i < L, 
and a single branch stems from each node at depth i from the root 

i 

for L <, i < L+T. We show such a tree codB in Figure 1 for T°3, n a 2, 
and a binary input. channel. To encode a binary information sequence 
of length L with such a code, one begins at the root node and moves 
through th?j tree taking the upper branch or the lower branch accor- 
ding as each successive information digit is 0 or 1. Since n(L+T) 
encoded digits result from this process, the true rate (in bits per 
channel input symbol) of the code is L/[n(L+T)J and hence approxima- 
tely equal to its supremum R ° 1/n only when L > > T. For> this reason, 
one ordinarily desires to choose L > > T. Wb shall call those nodes 
where the tree divides "information nodes" to correspond with their 
function in encoding the binary information sequence of length L. 

In this paper we shall derive an upper bound on the average decoding 
error probability, assuming maximum-likelihood decoding, for the 
ensemble of R, L, T tree codes in which each channel input symbol in 
the tree is chosen independently according to a specified probability 
distribution. 
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III. UPPER BOUND FOR RANDOM TREE CODES. 

For the binary tree codas described in II, wa shall now derive an 
upper bound on the average probability of decoding error, assuming 
that a maximum- likelihood decoder is used. 

Suppose that L information bits have been encoded and that thB resul 
tant n(L+T) encoded symbols have bean transmitted through a discrete 
memoryless channel. Let (i *> 1,2,...,L) bB the event that the 
probability of the received sequence given some incorrect path stem- 
ming from the i-th last information node along the correct path is 
equal to or greater than the probability of the received sequence 
given the correct path. Then, lBtting e be the event that a decoding 
error- is made by the maximum-likelihood decoder, we have 

e c E 1 U E 2 U ... U E l • Cl) 

Cwher8 we are prevented from writing " n " rather than "c" only by 
the fact that the maximum-likelihood decoder may correctly decode 

in case of ties for the best path). The average probability of 

* 

error over the ensemble of R, L, T tree codes is then bounded by 
the union bound 


P[<0 < * P[E 2 ] ♦ ... - P[E l ] . ■ (2) 

i * 

We now note that a total of 2 incorrect paths, each of length 

T+i branches, stem from the i-th last information node on the . 
correct path. Since over the ensemble of codes all symbols on each 
path are mutually independent, the symbols on the transmitted path 
stemming from the i-th .last node are independent of thB symbols on 
any incorrect path stemming from the 'same node. Thus, we may use 
Gallager's random coding upper bound on block codes of length n-(T + i) 
with (2 +1) codewords [2j to bound the i-th term on the righthand 

side of (2) as 


p[E.] < 2 a ' 1)p 2 _na+aE o a ^ 


9 


0 < p < 1 


(3) 
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where p is a parameter which can be chosen later. The function Eq(p) 
is defined as 

E n (p) » max {-log 7 I Cl QM P J 1+P ) . (4) 

U PCx) * Y X 

where X and Y are the channel input and output spaces respectively. 

p(y|x) is the channel transition probability distribution, and Q(x) 

« 

is a probability distribution on the input space. ThB maximizing 
Q(x) in C 4 ) is then also taken as the specific probability distribu- 
tion defining the ensemble of R, L, T tree codes. 


Inserting (3) in (2) yields 
L 

P[c] < l 2 (1 ‘ l)p 2-"‘ T * ilE o‘ p > 

~~ i-1 

< 2 " nTE 0 (p5 2 _p l 2 _i t nE a (p) “ p] 

. . 2" nTE 0 (p) , 0 < p < 1 

1 - 2" n6 


(5) 


where 

« 

% 

5 *> EqCpD - pR > 0 , (6) 

* 

Since Eg(p) is a monotonically increasing function of p. we have 
Theorem 1 : 

The average probability of error for maximum-likelihood decoding 
of the ensemble of binary R ■ 1/n, L, T tree codes satisfies ■ 

C 7 ) 


P[e] < c2 _nTE VU CR) 
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where Eyg(R) is Viterbi's upper bound exponent [3], namely 


E vu‘ R > 


R, 


sup E n (p) 


0 < R c Rq 
R g < R < C 


( 0 ) 


where the supremum is taken over p such that 0 <_ p <_ 1 and 
EgCp) > p R and where 


c 


2~ nE VU 

1 - 2** n5 


CR) 


C 9 ) 


The "constant" c depends on R but is independent of L and T. The 
exponent Rg is defined to be Eq C 1 3 • This exponent Rg is numerically 
equal to the "computational cutoff rate R comp M encountered in 
sequential decoding. The ratB C is the channel capacity. 

•The remarkable feature of thB bound C 7 ) is its independence of the 
length L of the treeJ The bound C 7) implies that only the tail ' 
length T is important in determining the error probability for 
tree codes. In Section V# we report simulations which verify this 
conclusion. 

♦ 

Viterbi [5] has given the same upper bound on the first-event error 
probability for time-varying convolutional codes, 

We also wish to re rk that for rates R < R Q , Massey [l] derived 
an upper bound equal to ( 7 J » His argument used the two codeword 
exponent Rg and this work stimulated the investigation reported in 
this paper to extend the bound from Rg to C and to conduct simula- 
tions to verify its implications. 

Massey [b] also recently presented the straightforward generalization 
of his formula to rates R » k/n < Rg# where k and n are integers and 
suggested the use in principle of convolutional codes with memory 

• length greater than tail length to remove the dependence of P[e] on 
tree length L as has been carried out in the simulations imported in 
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Section V. Massey's generalised argument can be extended to show 
that (7) holds for all rates R *» k/n provided that the constant M c" 
in C7) is given by 


c 


C2 k -1 ) p 


2~ nE yj CR) 

1-2" ni 


( 10 ) 


rather than by (9). We have omitted this refinement because, the 
generalization while straightforward is somewhat awkward and 
because R » 1/n is the case of greatest practical interest. 
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IV. UPPER BOUNDS FOR TRELLIS CD0E5, 

In Section II# wo described an R » 1/n, L, T tree code as the 
assignment of n « 1/R channel input symbols to each branch of a 
particular rooted tree# Recall that a sequence of L information 
bits specifies prefl'iely one path through this tree. We now define 
an R, L, T, M trellis code to be a tree code with the property that 

<i 

Ci) if the preceeding M information digits on the paths 
leading into two nodes at the same depth in the tree 
coincide then the same further encoded sequence results 
. whenever the’ same further information sequence is app“ 
lied starting from either node and 

(ii) all digits are the same on the last M-T branches. 

In other words, the "memory" or dependence on the past information 
bits is limited to the M previous information bits but the useful 
"tail” of the tree is only T rather than M branches in length. From 
an encoding viewpoint then, nodes with same preceeding M information 

i 

bits can be "merged" in the tree so that the passible encoding 
paths may be shown as forming a "trellis-like structure". In Figure 
2, we show an R ■ 1/2 binary trellis code with T=1 and M=2 for a 
binary input channel. Forney [4j was the first to use the term 
"trellis" in connection with a special class of such codes (viz. con- 
volutional codes with M a T? while Massey [l] generalized thB definition 
to that given here except again for the restriction that M a T. By allo- 
wing T < M we are able, as shown in the sequel, to demarcate rather 
precisely the different effects of the "tail length" T and the "memory 
length" M an decoding error probability. 

» 

Our artifice of requiring all of the digits on the last M-T branches 
of each path in the trellis to coincide renders these digits "useless" 
and hence unneccBssary to transmit over thB discrete memoryless 
channel being considered and hence to have a true "tail" of length 
only T branches, but this artifice also allows us to usb with only 
slight change the bounding techniques normally used for the "usual" 
trellis codes with M"T. 

* i 



a. 


Suppose now that L information bits have been encoded in on 
R, L/ T, M trellis code and that first n(L+T) encoded digits on 
the corresponding path (i.e. all except the *0501833” last n(M-T) 
digits which are the same on all paths in the trellis) and n(L*T) 
corresponding digits have been received over the discrete memory- 
less channel. 

Consider next any subpath of the correct path in the trellis, Vie 
define an "adversary" for this subpath to be any path which has 
the same first node and "remarge" with this subpath at its last 
node, i.e. it has this same last node but no previous node in corrmon 
with this subpath (except of course the first node). By our defini- , 
tion of a trellis code, an adversary must haVB length at least M+1 
branches since after diverging with the correct path at sorpe node 
there must be some “l consecutive information bits that agree with 
those on the correct path for remBrgence to take place. 

A maximum-likelihood decoder for the trellis codB will decode correctly 
unless there is some subpath of the correct path such that the proba- 
bility of the corresponding portion of thB received sequence given 
some adversary of this subpath is as great or greater than its pro 4 * 
bability given the subpath. In case of ties for the best subpath thB 
maximum-likelihood decoder may decode correctly. Hence wb begin our 
bounding of the decoding error probability by defining F. (M < j <_ L+M) 

J 

as the event that for some subpath of the correct path ending at the 
j-th node from the root along thB correct path the corresponding por- 
tion of the received sequence is as probable or more probable given 
some adversary of the subpath than given the subpath. Letting e be 
the event that the decoding is not correct, we then have 

t 

c c i>j +1 U F|v| + 2 (J- • • • » U (11) 


Using the union bound, we can then overbound the average error pro- 
bability for the ensemble of trellis codes in which each digit in 

J 

the trellis is chosen independently according to some probability 
distribution Q( ) over the channel input space as 


H<0 iPpH.il 4 p IW 



+ 


• • t 


♦ 


( 12 ) 
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and wa can ovsrbound P[Fj] using the random coding bound for cer- 
tain ensembles of block codes as wo now consider in detail. For 
. tha node at depth j from the root along the correct path, there is 
only one adversary of a subpath of length M+1 branches which rcmer- 

Om 2 

ges at this node. For l >, 2, there are at most 2 adversaries of 
a subpath of length M+l which remerges at this node as can be seBn 
from the fact that the first information bit for the adversary must 
disagree with that of the subpath while the last M information bits 
must agree and the information bit just previous to these must 
disagree with those of the subpath or remergerice would have occurred 
sooner. Hence M+2 of tha M+& information bits of an adversary of 
length M+A are uniquely specified when l >. 2. Thus 2 max ^°'*'~ 2 ^ is a 
general upper bound on the number of adversaries for a subpath of 
length M+2. branches, 2 ^ 1. 

In considering nodes at depth j from the root, for M < j <_ L+T 
all digits on the adversaries remerging at this nodB are statistically 
independent of those on the corresponding correct subpath so that 
using Gallager's upper bound on random block codes [2] wa have 

t 

V 

< j 2 max(0,£.-2) Jp 2 -n(M+2.)E 0 (p) 

1 

* 

< (2" nE 0^ ♦ ?~ 2p T 2~ n<S1 ) 2~ nME 0^ 

£“2 

% 

. 2~ nE 0 Cp) „-nME n (p) 

< ■ ■ ' " 1 ■ * & u 

1-2 n6 

■ c2 _nM£ 0 Cp) , 0 < p < 1 , M < j <_ L+T (13) 



’ But for L+T < j <. L+M, the digits on the last (j-M+T) branches of 
each adversary agree with those on the correct path because of our 
artifice of using the same channel input letter for all digits on 
the last M-T branches of every path in the trellis. Thus the block 
coding bound must bo revised to account for tha reduced useful 
codeword length and wo obtain 
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< ^^nax(0,t-2) jp ^ntL+M-j+THjEgtp) 
i“1 

< 2 -n£L*M-j)E 0 (p) ^ 2 -nEg{p) 

» 

+ 2~^P 2 "°^^) 2* n ^0^^ 

A«2 

< z -n(L.N-J)E 0 (p) c2 -nTE 0 (p) ( 

0 < p < 1 , L+T < j <. L+M 


(14) 


whsre in both (13) and £14) 

5 D E Q (p) ~ pR > 0 (15) 

and where c is as given in (9), i.e. c is the same constant as 
in (7). 

Finally, substituting (13) and (14) in (12) wo have 


P[e] < L f c2" nl1E 0 (p) . 
j«M+1 

l 

+ 2 -n(L+M-j)E Q (p) o2 -nTE Q (p) 

j°L+T+1 

.-mc /-•) <1 _ 7 -n(M-T)E n (p) 

“ (L*T-M)c2 i - r - 

1-2 n ¥ p) 


c2 -n T E Q (p) ^ 


0 < p < 1 , T <.M 


(16) 


Since Eq(p) is a 'monotonically increasing Function of p, we have 

« 

Theorem 2: 

The average probability of error for maximum-likelihood decoding 
* of thB ensemble of binary R ■ 1/n, L, T, M trellis codes satisfies 
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> 0 ] < c2~ nTE VU CR) 


1-2~ n 

-1-2" n 


n(M-T)E vu (R) 


^RT 


* (L+T-M)2 


■nM-TJEyutRJj 


C 17 ) 


where EyyCR) is given in (fl) and T <,M. 
Upon observing that, since T <_ M, 


. 1-2 
1-2 


n(M-T)E vu (R) 

Ir T~ 


■nE, 


VU* 


< M-T 


(18) 


we can state 


Corollary 1; 

The average probability of error for maxirnum-likelihood decoding 
of the ensemble of binary R D 1/n, L,T, M codes satisfies 

P[c] < Lc2~ nTE VU (R) (19) 

where E^j(R) is given in (8). In the special case when T e l y l, the 
bound of Corollary 1 is identical to Viterbi's well-known upper . 
bound for the ensemble of time-varying convolutional codes. 

Next, we notice that thB first term within the brackets in (17) is 
independent of L whereas the second term can be trade arbitrarily 
small for a given L by increasing M, Thus, by choosing that value 
of M which, for a given L, makes these two terms equal we have 


Corollary 2 ; 


The average probability of error for. maximum-likelihood decoding of 
the ensemblB of binary R » 1/n, L, T, M cades satisfies 


< c 


2~nTEyu(R) 


/ 


(20) 



provided 


M >_ T + [nEy^CR)]" 1 log 2 L 

where 


( 21 ] 


c' 


2c_ 

l-2-^I 


and Eyyy(R] is given in (B). 


( 22 ] 


The bound (20) of Corollary 2 is independent of the length l of 
the trellis and , is very similar to the bound (7) of Theorem 1 for 
the ensemble of tree codes 


We remark that Theorem 2 and its corollaries can be proved to 
hold for the ensemble of time-varying convolutional codes, but 
not presently for the ensemble of constant convolutional codes 
which lack the independence needed in the proof. However, we con- 
jecture that Theorem 2 and its corollaries hold also for the en- 
semble of constant convolutional codes which in fact are the type 
of convolutional code that has always been used in practice. Since 
there must always be at least one code whose P[eJ is no more than 
average, ’wa can state an even weaker 


Conjecture: 

The probability of error for maximum-likelihood decoding of a "good" 
binary R ° 1/n, L, T, M constant convolutional code satisfies 


P[e] < c2“ nTE VU tR) (23) 

» " 

provided 

• M>T + [nE^CR)]” 1 log 2 L (2-1) 

t 

i 

where c is a constant independent of L, T and fl and Eyy(R) is 
given in (B). 
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The conjecture is given strong support by the simulations discussed 
in Section V which is not too surprising since all "Viterbi type" 
error bounds for convolutional codes can presently be proved only 
for random or time-varying codes but all simulations to date have 
used "good” constant codes and the bounds have always been found 
to bo valid, i.e. the actual P[c] f or the constant code considered 
was smaller than jhe upper bound on P[c] for the ensemble of time- 
varying codes of that length. 

Finally, we note that by taking M B L+T, the ensemble of R, T, M 
trellis codes becomes exactly the ensemble of R, L, T tree codes, 

We have already noted that for M°T, the ensemble of R, L, T, M trellis 
codes becomes the ensemble of trellis codes defined b^ Massey [YJ, 
Hence our Theorem 2 is a generalization from which upper bounds on 
P[e] for both these ensembles follow as special cases. 


i 


/ 
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V. RESULTS DF SIMULATIONS. 

In order to test the implications of tho bounds for trellis codes 
derived in the previous section and in particular to test our con- 
jecture that these bounds apply to "Rood'* constant convolutional 
codes, decoding simulations for the binary symmetric channel (BSC) 
were conducted. 

Although the theory was developed for true maximum- likelihood de- 
coding, it is well-known [7] that the exponent of error probability 
for sequential decoding is the same as that for true maximum-likelihood 
or "Viterbi" decoding. Since the latter is too time-consuming for 
practical simulations except when M is very small, it was decided to 
perform the simulations using sequential decoding. The particular se- 
quential decoding algorithm employed was the quantized or "stack 
bucket" algorithm proposed by Jelinek [fl] which is the practical 
modification of the "stack algorithm" conceived independently by 
Zigangirov [9] and Jelinek. The simulations were all performed for 

I 

thB code rate R » 1/2, The "good" convolutional codes chosen were 
the "complementary codes" found by Bahl and Jelinek [lOj. Three 
different BSC's WBrB simulated, namely those with "crossover probabi- 
lity" p of 0.033, 0.045 and 0.057 which correspond to R ■ 0.9 Rg, 

R “ Rg and^R = 1.1 respectively when R ° 1/2. For each code used 
on each of these channels, a very large number (up to 60,000) of 
received "frames", i.e, complete received sequences of length n(L+T), 
were decoded so that the decoding error probability could be accura- 
tely inferred. The "metrics" used for the sequential decoding on 
each BSC are tabulated in Table I. 

In Figures 3, 4 and 5, we give the simulation results for the sequen- 
tial decoding undetected error probability P[eJ as a function of the 
tail length T of the convolutional code, (Because of the extreme 
variability of the computation in sequential decoding when M is large, 
there were occasions where the decoding had to be stopped because the 
computation exceeded the alloted maximum. The observed probability 
of this "overflow" is tabulated in TablB II and had negligible effect 
on the curves of Figures 3, 4 and 5.) These curves show that the 
actual P[e] decreases exponential with T with an exponent very close 
to that of tho bound (20) for the range T < M - [nE^y(R)] -1 log 2 L+2 



while further increases in T beyond this point have virtually no 
of foot on P[e] . Th is is in surprising agreement with tho effoct 
of 1 and T on P[c] in the bound of (17), It is rathor remarkable 
that the range of T for which the bound bocomos independent of l, 
viz. T <. M- [nEyy(R)] log2b is so cIosb to the range where the 
true P[e] becomes independent of L. Hence the relation (21) can 
bo taken as a slightly conservative design rule for choosing M so 
•that Pfc] is reduced to as little as possible for the tail length 
T that can be allocated to an, encoded frame. 
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VI. AN EMPIRICAL FORMULA FOR Pfel WITH SEQUENTIAL DECODING. 

Tho curves of Figures 3, 4 and 5 for P[c] versus T obtained from 
our sequential decoding simulations are well approximated by two 
straight lines giving the exponential decrease of P[c] with T up 
to the point where P[cJ becomes independent of T, The following 
empirical formula then provides a close match to the undetected 
error probability for sequential decoding of the "good" convolutio- 
nal codes used in these simulations! 

P[ej sj c2 nTR T , provided 14 >_ T + [nE^CR)] ^ log 2 L * (25) 

where the observed values of c and Ry for each of the decoding simu- 
lations performed are given in Tables III, IV and V, The near con- 
stancy of these parameters for wide variations in !4 and L when fl >_ 6 
suggest that these parameters can be well estimated in advance and 
used for design of sequential decoding systems. The case M=*4 is a 
case where the memory length is so small that the exponential app- 
roximation is not very well fulfilled. In fact, the apparent slight 
variation of c and Ry for large values of L is probably related more 
to the inaccuracies of the statistical values because of the small 
but increasingly non-negligible overflow probability P^ (as given 

in Table II) rather than to an actual variation of c and R x . 

> l 

The average values of Ry evaluated over M _> 0 and over all L (four 
values exceeding 0.98 are omitted) are given together with Viterbi's 
upper and lower exponents E^tR) and Ey^(R) for R = 1/2 in *Table VI. 
The exponents Ey^(R) and Ey^(R) are shown in Figure 8 where straight- 
line approximations are used when Rg < R < C. % 

From Table VI we conclude that 

EyyCR) <. Ry <. Ey L (R). , R < Rq (28) 

Ry - E VU (R) - E VL CR) , R 0 £R < C (27) 

Thus, Ry is in agreement with both the exponent E^tR) of the 
upper bound (20) and the exponent E^(R) of Viterbi's lower bound 
on the error probability in decoding a time-varying convolutional 
. code [3j . 
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VII, REMARK. 

Finally, we should remark that, if we wanted solely to minimize 
tho undBtec d error probability with sequential decoding for a 
given memory length and was not concerned with holding the tail 
size to a minimum to maximize the true rate of tho trellis code 
then the optimal value of the tail length is, of course, thB memory 
length, i.e. T*N. Probably this fact has caused investigators to 
ignore tho distinction between the "tail" and the "memory" so that 
the merory length came to.be honoured for work actually done by 
the tail. , . 
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Table i 


ChannBl 

transition 

probability 

R o 

R / R 0 

Mb trie 

Code symbol R sceivBd symbol 

0 1 

p - 0.033 

0.5D 

0.69 

0 1 -10 

1 -10 1 

* 

p - 0.045 

0.50 

1.00 

0 1-9 

1 .-9 1 

p - 0.057 

0,45 

1.11 ‘ 

♦ 

0 1 ‘ - 9 

1 - 9 1 
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TABLE II 

» . i ■ ■ V li 

; f : ■ ! 

The probability of computational overflow P for thB seouential 

» j | ® * | 

decoding simulations, (lODQ decoded jframog) , [ j 


p 



0 

16 

32 

64 

96 

128 

160 

192 

224 

256 

mm 

Hi 

320 



■ 


mm m^^m 


B 

.001 

.001 

.001 

,001 

wm 

.001 




6 


Rfflj 

,000 

.000 


B 

.002 


B 

.008 

.009 


.009 

.033 

a 


a 

B 

■ 


,001 

.001 

.002 

.004 

,005 

m 

.007 


• 

12 


.000 


m 

B 

B 

.001 

.001 

Ea 

,006 

.003 

.008 

,009 


16 



,000 


.001 

.002 

.005 

.005 

IBS 

,010 

.013 

.014 

.016 


■ 

* 

.000 

.000 

.000 

,001 

.003 

.004 

.005 

.009 

,013 

,016 

B 

.026 


6 


.000 

.000 

gn 

m 

.002 

.009 

.012 

.022 

.030 

.036 

.038 

.049 

.045 

Q 



.000 

B 

.002 

.006 

.013 

.016 

.034 

Hi 

.052 

.057 

.071 


12 


wm 

.000 

.000 

.002 

.005 

.012 

.020 

,031 

.042 

.050 

.058 

.073 


16 


wmmm 

muni 

MB 

.000 

.000 

.002 

B 

' .012 

B 

.034 

,043 

.051 

.059 

.073 


■1 

it 

.000 

, 

.000 

B 

H| 

.015 

.021 

.031 

.052 

B 

U 

.106 

.127 


6 


.000 

.ODD 

.000 

.009 

.020 

.048 

.070 

,106 

.133 

.153 

— 

.176 

.218 

.057 

8 


.000 

.000 


.007 

,030 

.051 

.071 

.130 

.161 

.189 

‘.216 

.249 


12 


.000 

.000 

.000 

.007 

.026 

.057 

.078 

.123 

.152 

.180 

.215 

.255 


16 


,000 

.000 

.000 

.008 

,030 

.055 

.084 

,132 

.160 

.190 

.225 

.277 


1 

1 

i 
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TABLE III : 1 

ii 

Results of simulations at p 
(1000 decoded frames )♦ 


0.033 (R 


C.9R 0 ) 
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TABLE IV 

'Results of simulation at p ■ 0.045 (R ■ R n ) 

i :: u * 

(1000 decoded frames), 'ill 


fen 

mm 


16 

32 

64 

96 

128 

■n 

192 

224 

256 

208 

320 

■ 

c 

.127 

.100 

.172 

,139 

.208 

.234 

,252 

,276 

. 266 

D 

,305 

.344 

r t 

.52 

.50 

.45 

.36 

.32 

.27 

.21 

.16 

.13 

.16 

.13 

mm 

5 

c 

.101 

.159 

,150 

,154 

.159 

.172 

m 

.169 

.141 

.168 

« 

.170 

,179 

r t 

.57 

.50 

.50 

.50 

.46 

.40 

,50 

.41 

.41 

.46 

.33 

a 

0 

c 

.106 

.156 

.175 

.168 

.153 

.154 

.170 

.154 

,127 

.132 

,146 

.133 j 

r t 

.60 

.54 

.47 

.40 

.47 

.45 

.45 

.48 

,50 

.50 

.43 

a 

12 

c 

.039 

.157 

.163 

,150 

.154 

.164 

,150 

.154 

.104 

.141 

.145 

m 

R r 

.53 

.51 

.50 



.49 

.47 

. 53 

.49 

.46 

.51 

.51 

,46 

.48 

16 


,103 

.151 

.153 

.149 

.143 

.152 

,162 

.140 

.111 

.138 

.145 

.146 



.48 

,47 

.54 

.50 

n 

.50 

.48 

.49 

.61 

.50 

.52 


v 
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TABLE V 1 I j j j 

Results of simulations at p • 0.057 (R ■ I.IRg) 
(1000 decoded frames). 
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T 


L 


K 


L, K and T in terms of branches. 


Fig. 2. 

An example of a binary trellis code with rate 1/2, memory length 
2 and tail length 1 for a binary input channel. 
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Same slope as theoretically 
expected for Ry - 0.50 


M » S (30 000 decoded frames] 


M = B (30 000 decoded frames! 


M » 10 (60 000 decoded frames) 


M ° 12 (60 000 decoded 

— — ^ frames 


16 (GO 000 
docoded 
frames ) 


The decoding error probability obtained from sequential decoding 
simulations versus the tail length of the convolutional code. 





Same slops as theoretically 
expected for Ry ■ 0.50 


' M ,■ 6 (30 000 decoded franes) 

"■© 9 ; — e- 


M ■ 0 (30 000 decoded frames) 

i - v 

a a n □ 



M ° ID (CO 000 decoded frames) 
t> 1> t > 


12 (BO 000 decoded 
a a f ramss 


\ ft - 16 (B0 000 
\ decoded 

\ frames ) 

\ 


T 


10 


12 


14 


16 


Fig. 4. 

The decoding error probability obtained from sequential decoding Simula- 
tions versus the tail length of the convolutional code,(L ■ 12Q* p ■ 0.0^ 



It 



M ■ 6 (30 000 decoded frames) 


M * Q (30 000 decoded frames) 


Same slope as theoretically 
expected for Rj. » 0»50 


II n 10 (60 000 decoded frames) 


M - 12 t 60 000 decoded 
: A frame 


\ M « 16 (60 ono 

\ * decoded 

\ frames } 


Fig. 5. • 

The decoding error probability obtained from sequential decoding simula 

tions versus the tail length of the convolutional code. 







